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The question what determines the structure of natural food webs has been listed 
among the nine most important unanswered questions in ecology^. It arises nat- 
urally from many problems related to ecosystem stability and resilienceii. The 
traditional^ view isi that population-dynamical stability is crucial for understand- 
ing the observed@flSS00 structures. But phylogeny (evolutionary history) has also 
been suggested^ as the dominant mechanism. Here we show that observed topologi- 
cal features of predatory food webs can be reproduced to unprecedented accuracy by 
a mechanism taking into account only phylogeny, size constraints, and the heredity of 
the trophically relevant traits of prey and predators. The analysis reveals a tendency 
to avoid resource competition rather than apparent competitionB. In food webs with 
many parasitesS this pattern is reversed. 

Empirical food-web data is notorious for its inhomogeneityS. In particular, the large 
number of species interacting in habitats has forced researchers to disregard whole subsys- 
tems or to coarsen the taxonomic resolution0. The representation of trophic interactions 
by the simple absence or presence of links in topological food webs is problematic, be- 
cause it turns out that by various measuresEl weak links are more frequent than strong links 
in natural food webs, and network structures depend on a somewhat arbitrary threshold- 
ing among the weak linksBE. Furthermore, the use of different methods for determining 
links0 might affect the result. Our analysis takes these difficulties into account by employ- 
ing a quantitative link-strength concept, an appropriate data standardization (see Supple- 
mentary Methods), and by reflecting the inhomogeneity of empirical methodology in our 
food- web model and data analysis. 
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Specifically, the following model ("matching model") describing the evolution of an ab- 
stract species pool is employed: The foraging and vulnerability traits of each species000 
are modeled by two sequence of ones and zeros of length n (the reader might think of 
oppositions such as sessile/vagile, nocturnal/diurnal, or benthic/pelagic). The strength of 
trophic links increases (nonlinearly) with the number m of foraging traits of the consumer 
that match the corresponding vulnerability traits of the resource (Figure [l|). A trophic link is 
considered as present if the number of matched traits m exceeds some threshold m > niQ. 
In addition, each species is associated with a size parameter s characterizing the (logarith- 
mic) body size of a species (0 < s < 1). Consumers cannot forage on species with size 
parameters larger than their own by more than A. The model parameter A (0 < A < 1) 
controls the amount of trophic loopsiil in a food web. 

The complex processes driving evolution are modeled by speciations and extinctions 
that occur for each species randomly at rates r+ and r_, respectively^. New species invade 
the habitat at a rate ri. Such continuous-time birth-death processes are well understood^. 
With r+ < r_ the steady-state average of the number of species is ri/ (r_ — r+). For new, 
invading species the 2n traits and the size parameter s are determined at random with equal 
probabilities. For the descendant species of a speciation (Figure [1]), each vulnerability 
trait is flipped with probability p^, each foraging trait is flipped with probability pf, and a 
zero-mean Gaussian random number 5 (var 5 = D) is added to the size parameter s of the 
predecessor (s = 0, 1 are treated as reflecting boundarieS). Such a random, undirected 
model of macroevolution becomes plausible if one assumes the trophic niche space to be 
in a kind of "occupation equilibrium": there are no large voids in niche space to be filled 
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and no niche- space regions of particularly strong predation pressure to avoid. 

The model has the adjustable parameters r+, r_, ri, A, mo, Pw, Pu and D. For large n 
food-web dynamics become independent of n, provided niQ is adjusted such as to keep the 
probability Co for link strengths to exceed the threshold constant (Supplementary Discus- 
sion). Throughout this work n = 256 is used. Figure |^ and Supplementary Figures 0-[T^ 
display the connection matrices of randomly sampled steady-state model webs in compar- 
ison with empirical data; a Supplementary Movie illustrates the model dynamics. 

The model was validated by comparing snapshots of the steady state with empirical 
data. Thus, only the relative evolution rates ri/r_ and r+/r_ matter. We set r_ = 1. 
The size-dispersion constant D has only a weak effect on results (for not too large A, D, 
only the ratio A/-D^/^ is relevant!^, as we verified numerically for D = 0.05/4) and was 
kept fixed at D = 0.05. The remaining six parameters r+, ri. A, uiq, p^, and pf were 
chosen such as to fit 14 ecologically relevant, quantitative food- web properties to empirical 
data (Supplementary Figure [T|), separately for each of 17 well-studied data sets (maximum 
likelihood fits, see Supplementary Methods for properties, sources of food-web data, and 
fitting procedure). Results are listed in Table |T]. Each fitted parameter set required ~ 10^ 
statistically independent Monte-Carlo simulations. 

To quantify the goodness-of-fit we computed statistics (Supplementary Methods) 
corresponding to the remaining 14 — 6 = 8 statistical degrees of freedom (DOF) for each 
data set (Table [1|, Xm)- Not all empirical food-webs are fitted equally well. For the three 
food webs labeled Scotch Broom, British Grassland, and Ythan Estuary 2 the value of 
exceeds the Bonferroni-corrected 95%-confidence interval x"^ < 23.0 (15 webs). Discrep- 
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ancies between the remaining 14 data sets and the model, on the other hand, are revealed 
only when pooling all 14 sets: J^X^ = 173 for 112 DOF gives p = 2 ■ 10~^. 

For comparisons, the niche model! (one of the best description known so far) was fit- 
ted to the data using the same procedure (Supplementary Methods), and the differences 
AAIC = AICm — AICn of the Akaike Information Criterion for fits to the matching model 
(AICm) and the niche model (AICn) were computed. This statistic takes the fact into ac- 
count that the matching model is more complex and contains four parameters more than the 
niche model. Negative AAIC indicate that the matching model describes the data better 
than the niche model and the increased model complexity is justified. The value is negative 
for 12 out of 17 models. In the cases where AAIC > this is due to unnecessary complex- 
ity of the matching model, and not due to a better fit of the niche model, as a comparison 
of the corresponding values (Table [T], niche model: Xn) shows. Pooling all data yields 

AAIC = —576 in favor of the matching model. A comparison of the nested hierarchy 
model0ilii with our model gives similar results (^ AAIC ^ -1480). 

Among the fitted model parameters some depend just as much on methodological 
choices at the time of recording the food web as on the actual ecology. In particular the 
linking probability Cq directly corresponds to the threshold for link assignment, and the 
invasion rate ri — as a parameter determining the web size — depends on the delineation of 
the habitat and the species-sampling effort. The degree of loopiness A might depend on the 
particular method used to determine links empirically. Adjusting these three parameters 
makes the model robust to differences in empirical methodology. 

The remaining three parameters r+, p^, and pf allow, at least partially, an ecological 
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interpretation. r+ = r^jr^ represents the fraction of species that entered the species pool 
by speciations from other species in the pool, in contrast to the remaining 1 — r+ that 
entered through random "invasions". The low values found for 1 — r+ (Table |T]) indicate 
that evolutionary processes are essential for generating the observed structures. 

The two quantities Pv and pf measure the variability of vulnerability and foraging traits 
among related species. We typically find pv much smaller than j9f (Table p. In particular, 
Pv < Vi in 14 of 17 data sets (p = 0.006). This implies that descendant species tend 
to acquire resources sets different from their ancestors but mostly share their enemies. 
We interpret this as a preference for avoiding resource competition rather than apparent 
competitionEl: A typical consumer is an expert for its particular set of resources (resource 
partitioning), and a typically resource set consists of a few "families" of related species — 
autotrophs or, again, expert consumers. 

The three exceptional data sets with Pv/Pf > 1 are exactly those most difficult to fit 
by the matching model (Table |T]). Interestingly, these are also the three data sets that con- 
tain large fractions (> 30%) of parasites, parasitoids, and pathogens (PPP) in the resolved 
species pool. The other data sets are dominated by predators, grazers, and primary pro- 
duces (PPP fraction < 5%). These observations are consistent with the expectations that 
(i) due to their high specialization PPP are less susceptible to resource competition than 
predatorsO and (ii) the matching model does not describe PPP well because it assumes a 
size ordering which is typical only for predator-prey interactiond^BiS. But further in- 
vestigations of these points are required. For example, contrary to expectations, Pv/Pf is 
close to one also for Ythan Estuary 1. 
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The matching model reproduces the empirical distributions of the numbers of consumers 
and resources of species well (Figure^, Supplementary Figures ^p^. Under specific con- 
ditions (see Supplementary Discussion) — including <^ pf — these become the "univer- 
sal", scaling distributions^ characteristic for the niche model (e.g.. Figure |3|, Caribbean 
Reef). But the distributions for food webs deviating from these patterns are also reproduced 
(e.g.. Figure ^ Scotch Broom). An earlier variant of the matching modeli could achieve 
this only under unrealistic assumptions regarding the allometric scaling of evolution rateJ^. 

Certainly there are also features of food webs that can only be understood by taking 
population dynamics explicitly into account. But, in view of the high accuracy reached 
with the matching model, careful modeling of phylogeny0lil@ should be a good starting 
point for further research. 
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TABLE 1: Goodness of fit and fitted parameters 

Food- web name Xm Xn AAIC ri 1 — r+ A mo Co Pv pf 

Benguela Current \A.\ 14.1 11.6 2.8 0.080 0.027 131 0.38 0.003 0.32 

Bridge Brook Lake n.O 12.4 15.2 1.4 0.033 0.13 136 0.17 0.000 0.068 

British Grassland 54.1 * IU.6 * -80.1 1.4 0.014 139 0.09 0.014 0.013 

Canton Creek 11.3 12.1 6.4 1.7 0.033 0.001 141 0.06 0.006 0.50 

Caribbean Reef 7.5 79.1 * -52.8 0.48 0.0082 0.068 133 0.29 0.008 0.39 

Chesapeake Bay 9.5 9.6 10.7 11.5 0.25 0.001 138 0.12 0.000 0.028 

Coachella Valley 5.9 31.9 * -13.0 1.4 0.049 0.034 124 0.71 0.002 0.10 

El Verde RainforestU.O 337.6 * -295.9 0.76 0.0054 0.12 139 0.09 0.015 0.036 

Little Rock Lake 11.3 85.5 * -46.8 1.3 0.0092 0.25 138 0.12 0.001 0.043 

Northeast US Shelf n.S 103.6* -73.9 0.28 0.0033 0.005 131 0.38 0.009 0.059 

Scotch Broom 25.8 * 83.3 * -42.5 1.3 0.0067 0.001 144 0.03 0.031 0.006 

SkipwithPond 14.3 39.9 * -10.3 1.4 0.045 0.033 130 0.43 0.011 0.12 

St. Marks Seegrass 19.3 37.1 * -3.5 0.55 0.0095 0.015 136 0.17 0.025 0.18 

St. Martin Island 7.4 13.7 -11.3 5.7 0.12 135 0.21 0.002 0.32 

Stony Stream 14.4 18.5 -3.5 0.13 0.0033 10-^41 0.06 0.014 0.35 

Ythan Estuary 1 20.5 42.0* -2.9 1.0 0.010 0.0029 140 0.08 0.033 0.037 

Ythan Estuary 2 46.4 * 46.6* 16.1 2.7 0.017 0.0022 141 0.06 0.041 0.038 

Stars (*) denote values outside the Bonferroni-corrected 95% confidence interval. 
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Figure Legends 

I - mm 

FIG. 1 : The main components of the matching model 

Each species (i, j, k) is characterized by n foraging and n vulnerability traits and a size parameter. 
Typically consumers (i) are larger than their resources (J). If the number m of matches between a 
consumer's foraging traits and a resource's vulnerabilities is large, trophic links result. In specia- 
tions (j — > k) some traits mutate. Foraging traits typically mutate more frequently than vulnerability 
traits. See text for details. 
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FIG. 2: Comparison between model steady state and empirical data 

The connection matrix of the Caribbean Reef web (red box) is compared to the matrices of 1 1 
random steady-state webs generated by the matching model (parameters as in Table [l]). Each black 
pixel indicates that the species corresponding to its column eats the species corresponding to its 
row. Diagonal elements correspond to cannibalism. Pixel sizes vary due to varying webs sizes. For 
better comparison, data are displayed after standardization, a random permutation of all species, 
and a subsequent re-ordering such as to minimize entries in the upper triangle. Characteristic are, 
among others, the vertically stretched structures0 reflecting the strong inheritance of consumer sets. 
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FIG. 3: Food- web degree distributions 

Cumulative distributions for the number of resources (upper panels) and consumers (lower panels) 
of species for the Caribbean Reef and Scotch Broom webs after data standardization. Points denote 
empirical data, solid and dotted lines model averages for matching and niche model, respectively. 
2c7-ranges are indicated in green (matching model) and grey (niche model), olive at overlaps. Model 
parameters as in Table |T| 
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Supplementary Methods 



A. Food- Web Data 

The food-web data base used in this work was provided by N. D. Martinez and his 
team. The following are references the original sources: Benguela CurrenE, Bridge Brook 
LakM, British Grassland, Canton CreelM, Caribbean ReeJ^, Chesapeake Ba^, Coachella 
Valle^, El Verde Rainforest Little Rock LakM, Northeast US 5/ze//0, Scotch Broon^^ 
Skipwith Pon<^, St. Marks Seegras^, St. Martin Island, Stony StreanB, Ythan Estu- 
ary Ythan Estuary 



B. Statistical Analysis 

I. Data standardization 

Both empirical and model data were evaluated/compared after applying a data standardiza- 
tion procedure to the raw data. The procedure consists of three steps: 

1. Deleting disconnected species and small, disconnected sub-webs. Graph theory pre- 
dicts that there will be only a single large connected component. We keep only this 
large component. 

2. Lumping of all species at the lowest trophic level into a single "trophic species". 
We do this, because in some data sets the lowest trophic level is already strongly 
lumped. For example, the Chesapeake Bay web contains a species "phytoplankton", 
and Coachella Valley "plants/plant products". On the other hand, food webs such as 
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Little Rock Lake resolves the phytoplankton at the genus level. Lumping the lowest 
level improves data intercomparability. 

3. The usual lumping of trophically equivalent species into single "trophic species"0. 

For some data sets with a simple structure, this procedure leads to a considerable reduction 
of the web size (e.g., Bridge Brook Lake shrinking from 74 species to 15). But generally 
this is not the case. 

2. Food-Web Properties 

Besides the number of species S and the number of links L expressed in terms of the 
directed connectancel C = L/S'^ , the following 12 food- webs properties were used to 
characterize and compare empirical and model webs: the clustering coefficient§lii (Clust 
in Supplementary Figure |1[); the fractions of cannibalistic species@ (Cannib) and species 
without consumer s0 (T, top predators); the relative standard deviation in the number of 
resource species0 {GenSD, generality s.d.) and consumer S0 (VulSD, vulnerability s.d.); 
the web average of the maximum of a species' Jaccard similarityil with any other species^ 
(MxSim); the fraction of triples of species with two or more resources, which have sets of 
resources that cannot be ordered to be all contiguous on a line§ (Ddiet); the averageO 
(aChnLg), standard deviation! (aChnSD), and average per-species standard deviation0 
(aOmniv, omnivory) of the length of food chains, as well as the log^Q of their total number! 
(aChnNo), with the prefix a indicating that these quantities were computed using the fast, 
"deterministic" Berger-Shor approximation^ of the maximum acyclic subgraph (MAS) of 
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the food web. The number of non-cannibal trophic links not included in the MAS was 
measured as aLoop. When the output MAS of the Berger-Shor algorithm was not uniquely 
defined, the average over all possible outputs was used. 

All food- web properties were calculated after data standardization as described above. 



3. Goodness-of-fit statistics 

Mean and covariance matrix of the food-web properties described above, including C 
but not S, were computed for the model steady state and projectedil to S fixed at the 
empirical value. The corresponding log-likelihood and the of the empirical values were 
computed thereof assuming Gaussian distributions. See Ref. P5]for details. 



4. Parameter Fitting 

The fitting parameters listed in Table [T| (except ri) were chosen such as to maximize 
the log-likelihood computed as described above (maximum likelihood estimates). Given 
the other parameters, ri was always adjusted such as to make the model expectation value 
of S, determined from Monte Carlo simulations, match the empirical value. The Akaike 
Information Criterion follows directly from the log-likelihood of the best-fitting parameter 
set. 

In order to compute a comparable Akaike Information Criterion for the niche model, 
some modification of the original prescription for this model0 were required: 



We applied the data standardization ([Supplementary Methods B T|) to both model and 
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empirical data, 

• determined the niche-model parametei@ [3, which controls the connectance, by a 
maximum-likelihood estimate as above, and 

• determined the number of species of model webs before data standardization such as 
to match the expected number of species after data standardization with the empirical 
data, just as described above for the parameter ri of the matching model. 
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Supplementary Discussion 

A. Derivation of Link Dynamics for Large n 

Here we explain why the network dynamics of the matching model becomes indepen- 
dent of n for large n, if mo is properly adjusted as n increases. First, consider a single 
trophic link from a (potential) consumer to a (potential) resource. Denote the foraging 
traits of the former by fi, the vulnerability traits of the latter by Vi, where i = 1, . . . , n and 
fi, Vi e {0, 1}. 

1. Linking Probability 

Consider the steady-state distribution of the link strength m defined by 

1 if fi^Vi 

m = J2\ ■ (1) 

-1 if/.7^^,J 

Since the fi and Vi are equally, independently distributed, m follows a binomial distribution 
with mean n/2 and standard deviation a — The probability for a link to exceed the 

threshold mo is 

Co:^P{m>mO)^2- Q. (2) 

m=mo 

The distribution of x := {m — n/2) /a converges to a standard normal distribution of large 
n. The linking probability Co converges to a fixed value (27r)~^/^ exp(— 1^/2) dt if mo 
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is adjusted such that (mo — n/2)/a converges to a fixed value xq. 
2. Mutation as an Integrated Omstein-Uhlenbeck Process 

In the following we argue that the dynamics of x between speciations can be charac- 
terized as an integrated Omstein-Uhlenbeck process if n is large. First, consider only a 
single link, as above. When the resource speciates, its vulnerability traits are inherited by 
the descendant species, but with probability py they flip from i>j to 1 — Vj. If Pv < 1/2 this 
single step can be divided into a series of K small steps, where a property Vi is flipped in 
each step with a small probability q and otherwise left unchanged. Taking the possibility 
that properties are flipped repeatedly into account, one finds that the K small steps are 
equivalent to the speciation step if 

Pv = ^[l-(l-2gn (3) 

or 

, = -Ml^ + 0(;i-^). (4) 

For sufficiently large K one has -C 1. Then at most one trait is flipped in each step, 
and the change inx — {m — n/2)/a is of order ~ As n increases, it becomes 

arbitrarily small. 

Denote the value of m after the k-th step by m^. At each step, if ruk is known, the 
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probability distribution of nik+i depends only on n and m^. If qn <^ 1, for example, one 
has nik+i = nik — 1 with probability irik q, mk+i = ruk + 1 with probability (n — m) q, and 
otherwise ruk+i = rrik. Thus the dynamics of m — and of x — from step to step are Markov 
processes. 

These three properties of the step-by-step dynamics of x in the limit of large n and K 

1. normal distribution in the steady state 

2. Markov property 

3. arbitrarily small changes from step to step 
identify the dynamics as an Omstein-Uhlenbeck processB 



dx{T) = -fix{T) dt + i]dW{T), (5) 



where W{t) is a Wiener processQ and t = k/K.ln particular, one finds 



/i = -log(l - 2pv), r/ = V2/i. (6) 



The value of x for a link from a speciating resource to its consumer is given by the integral 
of Eq. over a r-interval of unit-length, starting with the value of x for the ancestor. 
This implies that of the correlation of x between direct relatives is (1 — 2pv) and between 
relatives of l-th degree (1 — 2pv)'- The corresponding results for a speciating consumer are 
obtained by replacing py in Eq. @ by pf. 
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For the inheritance of several links to unrelated (hence uncorrelated) consumers, Eq. @ 
holds for each link, and the Wiener processes are uncorrelated. For links to unrelated 
resources correspondingly. For links to related species the Wiener processes are correlated. 
From invariance considerations regarding the temporal ordering of evolutionary events in 
local networks one finds that for relatives of l-th degree this correlation is (1 — 2pf)^ for 
species-as-consumers and (1 — 2pv)' for species-as-resources. The correlations between 
links to related species from a newly invading species also follow this pattern. This provides 
a full characterization of the link dynamics for large n independent of n. 

B. Relation to Previous Analytic Results 

In order to make the analytic characterizations of the degree distributions and other 
food-web properties obtained for an earlier model varianti accessible for the matching 
model, we derive an approximate description of the link dynamics that refers directly to 
the inheritance of connectivity between species, i.e., of the information if a link is present 
or not, rather than the inheritance of traits determining links. 

Mathematically, this corresponds to a Markov approximation for the dynamics of the 
connectivity in the following form: If resource B speciates to C, its connectivity informa- 
tion to a consumer A is lost with a probability (independent of the previous history) 
and otherwise copied from B to C. When the information is lost, a link from C to A is 
established at random with probability Cq. 

The breaking probability /3v can be obtained by equating the probabilities A eats C given 
that A eats C's ancestor B for the exact description (in terms of and mo) and the Markov 
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approximation. This gives 



1 " n-mi mo-l ■ 2k+mi-m2 {1 „ \n-2k-mi+m2 

Q = - V V V ^^-p-> p. 

2" (1- Co) Co ^ ^ ^ k\{k + mi-m2)\im2-k)\{n-mi-k)\ ' 

^ ' ?ni=?no fc=0 m2=k ^ / \ / \ / 



with Co defined by Eq. (g). The corresponding expression for Pf is obtained by replacing 
Pv in Eq. by pf. Results of Ref. ^ can be applied to the matching model with the 
replacement of the parameter (3 in Ref. ||by /?v. 

Most analytic results of Ref. || rely on the unrealistic assumption of consumers evolving 
much slower than their resources. This assumption is used to argue for 

1. fully developed correlations of connectivity from one consumer to related resources 
and 

2. absence of correlations for connectivity from one resource to related consumers. 

Effects 1 and 2 are then used to simplify calculations. In the matching model 1 and 2 can 
be obtained without assuming large differences in speciation rates: Effect 1 is obtained 
because statistical correlations in connectivity to related resources in the matching model 
depend only on the correlations between the traits of the resources, and not on the evolu- 
tionary history of the consumer (see also [Supplementary Discussion A[). The correlations 



are large if is small and, as a result, /3v is small. Effect 2 is obtained when pf is close to 
0.5 (foraging traits are randomized in speciations), which implies that Pf is close to 1. 

Results of Ref. || that contribute to a better understanding of the matching model in- 
clude the derivation of the conditions under which the degree distributions become those 
of the niche model, and the explanation why model webs, just as empirical datJ, exhibit a 
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larger-than-random degree of "intervality". The average number of resource "families" (or 
"clades") of a consumer in the matching model can also be estimated, and turns out to be 
small: The largest value (3.7) is obtained for the top predator of Ythan Estuary 2. For most 
other webs this number is below two. 



Gardiner, C. W. Handbook of Stochastic Methods (Springer, Berlin, 1990), 2nd edn. 
^ Rossberg, A. G., Matsuda, H., Amemiya, T. & Itoh, K. Some Properties of the Speciation Model 
for Food- Web Structure — Mechanisms for Degree Distributions and Intervality. J. Theor. Biol. 
(2005). In press. 

^ Cohen, J. E., Briand, F. & Newman, C. M. Community Food Webs: Data and Theory, vol. 20 of 
Biomathematics (Springer, Berlin, 1990). And references therein. 
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Supplementary Movie Legend 



(The movie can be found at |http : / / ag . rossberg . net /matching . mpg| or 



http : / / www . envcomplex . ynu . ac . jp/mat ching . mpg.) 



This 1 minute movie (MPEG, 7MB) illustrates the dynamics of the matching model. 
The movie shows the evolution of the connection matrix of food webs in the model steady 
state at parameters corresponding to Little Rock Lake (Table [l]). Each black pixel indi- 
cates that the species corresponding to its column eats the species corresponding to its 
row. Diagonal elements correspond to cannibalism. To ensure temporal continuity, the 
raw data — prior to data standardization — are show. Thus, these matrices are not directly 
comparable to the matrices displayed in Figure ^ and Supplementary Figures @-|18|. Species 
are sorted by decreasing size parameter s from top to bottom and left to right. The movie 
shows one evolutionary event (speciation, extinction, invasion) per frame at 25 frames per 
second. 
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Supplementary Figure Legends 

A. Results for Food- Web Properties 

In Supplementary Figure |1] the best fitting results for the matching model (red starts) 
and for the niche model (blue boxes) are compared to the empirical data (horizontal lines). 
Vertical lines correspond to ± one model standard deviation. Because the properties are 
computed conditional to fixed S, the value of S always fits exactly. Note that the graph 
does not contain the full information about the covariance matrices that entered the and 
likelihood calculations, and therefore indicates the goodness of fit only semiquantitatively. 



B. More connection matrixes and degree distributions 

Supplementary Figures |^ to [r| present the results corresponding to Figures ^ and ^ 
(main text) for all food webs and the two models considered. In each figure, the first panel 
shows the connection matrix of the empirical food web in a red box compared to the first 
1 1 random samples obtained from a simulation of the matching model. As in Figure ^ 
each black pixel indicates that the species corresponding to its column eats the species 
corresponding to its row. Diagonal elements correspond to cannibalism. Pixel sizes vary 
due to varying webs sizes. Data are displayed after standardization, a random permutation 
of all species, and a subsequent re-ordering such as to minimize entries in the upper triangle. 
The second panel in each figure displays the corresponding data for the niche model. 

The two bottom panels compare model and empirical degree distributions (model pa- 
rameters as in Table p. As in Figure ||, points denote empirical data, and solid and dotted 
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lines model averages for matching and niche model, respectively; 2(T-ranges are indicated 
in green (matching model) and grey (niche model), olive at overlaps. All model distribu- 
tions were calculated conditional to 5* fixed at the empirical value. 

Since, for the purpose of data standardization, the lowest trophic level is lumped to a 
single trophic species, there is always exactly one "species" that does not consume others. 
As a result, the second point in the cumulative distribution of the number of resources is 
always fixed at (5—1)/ 5*. Because the consumers of this lumped species are the consumers 
of all species that were lumped into it, the number of consumers of the lumped species is 
comparatively large, which leads to a leveling-off at the tails of consumer distributions as 
compared to the distributions for the raw data shown in Ref. [1]. 

The informations provided by the connection matrices and the degree distributions are 
complementary. While the degree distributions give integral information regarding the 
whole web, the connection matrices give an impression of the correlations present between 
individual species as well as the fluctuations in the steady state. 



Stouffer, D. B., Camacho, J., Guimera, R., Ng, C. A. & Amarala, L. A. N. Quantitative patterns 
in the structure of model and empirical food webs. Ecology 86, 1301-1311 (2005). 
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Supplementary Figures 



FIG. 1 : rotated for better resolution, see [Supplementary Figure Legends A for explanations 
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FIG. 2: Benguela Current (see 



Supplementary Figure Legends B 



for explanations) 



Matching model: 




distribution of number of resources distribution of number of consumers 




number of species number of species 
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FIG. 3: Bridge Brook Lake (see 



Supplementary Figure Legends B 



for explanations) 



Matching model: 
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FIG. 4: British Grassland (see 



Supplementary Figure Legends B 



for explanations) 



Matching model: 




distribution of number of resources distribution of number of consumers 




number of species number of species 
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FIG. 5: Canton Creek (see 



Supplementary Figure Legends B| for explanations) 



Matching model: 




distribution of number of resources distribution of number of consumers 




number of species number of species 
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FIG. 6: Caribbean Reef (see 



Supplementary Figure Legends B 



for explanations) 
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FIG. 7: Chesapeake Bay (see 



Supplementary Figure Legends B| for explanations) 
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FIG. 8: Coachella Valley (see [Supplementary Figure Legends B for explanations) 
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FIG. 9: El Verde Rainforest (see [Supplementary Figure Legends B 



for explanations) 
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FIG. 10: Little Rock Lake (see 



Supplementary Figure Legends B 



for explanations) 



Matching model: 
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FIG. 11: Northeast US Shelf (see 



Supplementary Figure Legends B 



for explanations) 




Niche model: 
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FIG. 12: Scotch Broom (see Supplementary Figure Legends B for explanations) 
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FIG. 14: St. Marks Seegrass (see 



Supplementary Figure Legends B 



for explanations) 
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FIG. 15: St. Martin Island (see 



Supplementary Figure Legends B| for explanations) 



Matching model: 
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FIG. 16: Stony Stream (see 



Supplementary Figure Legends B 



for explanations) 



Matching model: 




number of species number of species 
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FIG. 17: Ythan Estuary 1 (see Supplementary Figure Legends B for explanations) 
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FIG. 18: Ythan Estuary 2 (see 



Supplementary Figure Legends B 



for explanations) 



Matching model: 
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